Speech prosody in phonetics and technology
نویسنده
چکیده
As features unique to spoken language, speech prosody plays an important role in human communication. Although the acoustic features of speech are viewed most frequently in a frame-byframe manner, this is not always appropriate for prosodic features, since they are tightly related to higher level linguistic information, such as syntactic and discourse structures, and spread to wide time spans, such as syllables, words, and phrases. In order to handle the situation, models for prosody have been developed. Among many models, the generation process model of fundamental frequency contours is attractive, since it can relate well to the linguistic information of utterances. The model was successfully applied to hidden Markov model (HMM) based speech synthesis and a listening test to determine the (perceptual) categorical boundaries of Japanese accent types.
منابع مشابه
(what Is) the Contribution of Phonetics to Contemporary Speech Synthesis Research
Kurzfassung: Recent advances in speech technology have significantly reduced the necessity for traditional phonetic system components or phonetic expertise, e.g. rule-based prosody models. We therefore need to ask the question, whether and how phonetics ought to play a role in ongoing and future speech synthesis development. The answer can be derived directly from a global analysis of the weakn...
متن کاملPROSICE: A spoken English database for prosody research
Prosody the study of the intonation, stress and rhythm of speech is now assuming a greater importance in phonetics, phonology and speech technology than ever before. Once regarded as subservient to studies of segmental structure, it is now being seen as providing the ‘framework’ which holds different levels of phonetic description together. The recent past has seen novel views of the phonology ...
متن کاملParalinguistic Phonetics in NLP Models & Methods
Natural language processing (NLP) is gradually becoming a more multidisciplinary field, and research in remotely connected aspects of language such as paralinguistic phonetics may benefit from as well as contribute to some areas of NLP. This paper provides a brief overview of paralinguistic phonetics, and some current NLPrelated methods and models used in TTS and ASR systems today. In order to ...
متن کاملSpeech Synthesis
Speech Synthesis is undoubtedly a technological challenge with many potential applications in human-machine communication. More basically, it is a crossroads where researchers with many different backgrounds collaborate to put together their knowledge in computational linguistics, phonetics, prosody, physiology, vocal tract modeling, signal processing, image synthesis, experimental psychology, ...
متن کاملChapter 21 Part II : Experimental methods and paradigms for prosodic analysis
There is a long tradition of experimental research in the field of prosody, as different aspects of speech production and perception related to prosody have often been part of traditional laboratory and phonetics investigation. However, in recent years, the development of a set of laboratory tools to investigate language performance and its neurocognitive basis has prompted a new experimental a...
متن کامل